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ABSTRACT 

We propose a network characterization of combinatorial fitness land- 
scapes by adapting the notion of inherent networks proposed for 
energy surfaces [5j. We use the well-known family of NK land- 
scapes as an example. In our case the inherent network is the graph 
where the vertices are all the local maxima and edges mean basin 
adjacency between two maxima. We exhaustively extract such net- 
works on representative small NK landscape instances, and show 
that they are 'small- worlds'. However, the maxima graphs are not 
random, since their clustering coefficients are much larger than 
those of corresponding random graphs. Furthermore, the degree 
distributions are close to exponential instead of Poissonian. We 
also describe the nature of the basins of attraction and their rela- 
tionship with the local maxima network. 

Categories and Subject Descriptors 

1.2.8 [Artificial Intelligence]: Problem Solving, Control Methods, 
and Search — Heuristic methods; G.2.2 [Discrete Mathematics]: 
Graph Theory — Network problems 

General Terms 

Algorithms, Measurement, Performance 

Keywords 

Landscape Analysis, Network Analysis, Complex Networks, Local 
Optima, NK Landscapes 
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A fitness landscape of a combinatorial problem can be seen as a 
graph whose vertices are the possible configurations. If two con- 
figurations can be transformed into each other by a suitable opera- 
tor move, then we can trace an edge between them. The resulting 
graph, with an indication of the fitness at each vertex, is a rep- 
resentation of the given problem fitness landscape. Doye {5\ |6) 
has recently introduced a useful simplification of the fitness land- 
scape graph for the energy landscapes of atomic clusters. The idea 
consists in taking as vertices of the graph not all the possible con- 
figurations, but only those that correspond to energy minima. For 
atomic clusters these are well-known, at least for relatively small 
assemblages. Two minima are considered connected, and thus an 
edge is traced between them, if the energy barrier separating them 
is sufficiently low. In this case there is a transition state, meaning 
that the system can jump from one minimum to the other by ther- 
mal fluctuations going through a saddle point in the energy hyper- 
surface. The values of these activation energies are mostly known 
experimentally or can be determined by simulation. In this way, 
a network can be built which is called the "inherent structure" or 
"inherent network" in (5). We use a modification of this idea for 
studying the well-known NK combinatorial landscapes. In our 
case, a vertex of the graph is a local maximum, and there is an edge 
between two maxima if they lay on adjacent basins. 

In the context of meta-heuristics, it is important to identify the 
features of landscapes that would influence the effectiveness of 
heuristic search. Such knowledge may be helpful for both predict- 
ing the performance and improving the design of meta-heuristics. 
Among the features of landscapes known to have a strong influence 
on heuristic search, is the number and distribution of local optima 
in the search space. An interesting property of combinatorial land- 
scapes, which has been observed in many different studies, is that 
on average, local optima are very much closer to the global op- 
timum than are randomly chosen points, and closer to each other 
than random points would be. In other words, the local optima are 
not randomly distributed, rather they tend to be clustered in a "cen- 
tral massif (or "big valley" if we are minimising). This globally 
convex landscape structure has been observed in the NK family of 
landscapes 1111 . and in many combinatorial optimisation problems, 
such as the traveling salesman problem |2], graph bipartitioning 
1131 . and fiowshop scheduling 1161 . 



In this study we seek to provide fundamental new insights into 
the structural organization of the local optima in combinatorial land- 
scapes, particularly into the connectivity and characteristics of their 
basins of attraction, using NK landscapes as a case study. To 
achieve this, we first map the landscape onto a network, and then 
analyze the topology of this network for a number of small NK 
landscape instances for which complete networks can be obtained. 
Our analysis is inspired, in particular, by the work of Doye (5] [6) 
on energy landscapes, and in general, by the field of complex net- 
works 1 14 20 21 1. The study of complex networks has already 
permeated the evolutionary computation field. Specifically, in the 
study of scientific collaborations (3l 1121 . the structure of a popu- 
lation in cellular evolutionary algorithms f9l llQ|[T31 . and the evo- 
lution of networks of cellular automata |19|. However, our study 
is the first attempt, to our knowledge, of using network analysis 
techniques in connection with the study of fitness landscapes and 
problem difficulty in combinatorial optimization. 

The next section introduces the study of complex networks, and 
describes the main features of small-world and scale-free networks. 
Section[3]describes how landscapes are mapped onto networks, and 
includes the relevant definitions and algorithms. The empirical net- 
work analysis of our selected NK landscape instances is presented 
in Section [4] whilst Section [5] gives our conclusions and ideas for 
future work. 

2. COMPLEX NETWORKS 

The recent interest in the study of networks and networked sys- 
tems was influenced by the seminal paper by Watts and Strogatz 
1211 . who showed that many real- world networks are neither com- 
pletely ordered nor completely random, but rather exhibit important 
properties of both. Some of these network properties can be quanti- 
fied by simple statistics such as the clustering coefficient C, which 
is a measure of local density, and the average shortest path length 
I, which is a global measure of separation. It has been shown in re- 
cent years that many social, biological, and man-made system show 
what has been called a small-world topology 1211 . in which nodes 
are highly clustered yet the path length between them is small. 

A second important aspect in the study of networks has been 
the realization that in many real-world networks, the distribution 
of the number of neighbours (the degree distribution) is typically 
right-skewed with a "heavy tail", meaning that most of the nodes 
have less-than-average degree whilst a small fractions of hubs have 
a large number of connections. These qualitative description can 
be described mathematically by a power-law (l], which has the 
asymptotic form p(k) ~ k~ a . This means that the probability of 
a randomly chosen point having a degree k decays like a power of 
k, where the exponent a (typically in the range [2, 3]) determines 
the rate of decay. A distinguishing feature of power-law distribu- 
tions is that when plotted on a double logarithmic scale, a power- 
law appears as a straight line with negative slope a. This behav- 
ior contrasts with a normal distribution which would curve sharply 
on a log-log plot, such that the probability of a node having a de- 
gree greater than a certain "cutoff value is nearly zero. The mean 
would then trivially represent a characteristic scale for the network 
degree distribution. Since networks with power-low degree distri- 
bution lack any such cutoff value, at least in theory, they are often 
called scale-free networks |20|. Examples of such scale-free net- 
works are the world-wide-web, the internet, scientific collaboration 
and citation networks, and biochemical networks. 

3. LANDSCAPES AS NETWORKS 

To model a physical energy landscape as a network, Doye |6| 



needed to decide first on a definition both of a state of the system 
and how two states were connected. The states and their connec- 
tions will then provide the nodes and edges of the network. For sys- 
tems with continuous degrees of freedom, the author achieved this 
through the 'inherent structure' mapping [ 18 1. In this mapping each 
point in configuration space is associated with the minimum (or 
'inherent structure') reached by following a steepest-descent path 
from that point. This mapping divides configuration into basins of 
attraction surrounding each minimum on the energy landscape. 

We use a modification of this idea for the NK family of bi- 
nary landscapes, which indeed can be applied to any combinato- 
rial landscape. In our case, the vertexes of the graph are the local 
maxima of the landscape, obtained exhaustively by running a best- 
improvement local search algorithm (see Algorithm 1) from every 
configuration of the search space. The edges in the network connect 
local optima of adjacent basins of attraction. An illustration for a 
model 2D landscape can be seen in FigureQ] which is inspired by a 
similar figure appearing in |5 6|. Here, we illustrate a network of 
local maxima (instead of local minima). A more formal definition 
of our inherent networks is given in Section [3T| As it was the case 
in the study on physical energy landscapes |6|, we do not consider 
multiple edges, or weights in the edges. This may be a factor to 
consider in future work. 




Figure 1: A model of a 2D landscape (left), and a contour plot of 
the local optima partition of the configuration space into basins 
of attraction surrounding maxima and minima (right). A sim- 
ple regular network of six local maxima can be observed. 



Note that while a physical energy landscape is formally a con- 
tinuous landscape, ours are strictly combinatorial, i.e. discrete and 
finite. Moreover, the energy landscape of a stable atomic cluster, 
crystal or molecule is relatively smooth and easy to search and has 
been called a "funnel" landscape |5|. In contrast, in NK land- 
scapes one can continuously vary the intrinsic landscape difficulty 
by changing the value of K. As a result, we shall see that NK 
landscapes show a number of different behaviors depending on K 
for a given N, and these different behaviors are reflected on their 
inherent networks. Indeed, NK landscapes can be seen as analo- 
gous to those of spin-glasses I11II17I . In contrast to atomic cluster 
energy landscapes, spin glass landscapes may show frustration, i.e. 
configurations that must respect conflicting constraints, and solving 
for the ground state of the system that is, the minimum energy con- 
figuration is an NP-hard problem. Similar consequences are caused 
by the introduction of epistatic interactions through the increase of 
the K value in NK landscapes. 

Below we present the relevant formal definitions and algorithms 
to obtain our combinatorial analogous of an energy landscape in- 
herent network. 

3.1 Definitions and Algorithms 



Definition : Fitness landscape. 
A landscape is a triplet (S, V, /) where S is a set of potential solu- 
tions i.e. a search space, V : S — ► 2 s , a neighborhood structure, 
is a function that assigns to every sGSa set of neighbours V(s), 
and / : S — > R is a fitness function that can be pictured as the 
height of the corresponding potential solutions. 

In our study, the search space is composed by binary strings of 
length N, therefore its size is 2 . The neighborhood is defined by 
the minimum possible move on a binary search space, that is, the 
1-move or bit- flip operation. In consequence, for any given string 
s of length TV, the neighborhood size is |V(s)| = N. Notice that 
in NK landscapes, two neighboring solutions never have the same 
fitness value. Therefore, neutrality is not present. Landscapes with 
neutrality will be considered in future work. 

Definition: Local Optimum. 
A local optimum is a solution s* such that Vs G V(s*), f(s) < 
/(«*)• 

The LocalSearch algorithm to determine the local optima and 
therefore define the basins of attraction, is given below: 



Algorithm 1 LocalSearch 
Choose initial solution s G S 
repeat 

choose s G V(s) such that f(s ) = max xeV (s) f{x) 
if/0) < /(s')then 

s <— s 
end if 

until s is a Local optimum 



The LocalSearch algorithm defines a mapping from the search 
space S to the set of locally optimal solutions 5™ . We therefore 
define a basin of attraction as follows: 

Definition : Basin of attraction. 
The basin of attraction of a local optimum i is the set bi = {s G 
S | Local Searches) = i}. The size of the basin of attraction of a 
local optima i is the cardinality of bi. 

We then define the inherent network, or network of local optima 

as: 

Definition : Local optima network. 
The local optima network G = (S*,E) is the graph where the 
nodes are the local optima, and there is an edge G E between 
two local optima i and j if there is at least a pair of direct neighbors 
(1-bit apart) Si and Sj, such that Si G h and Sj G bj. That is, if 
there exists a pair of direct neighbors solutions Si and Sj, one in 
each basin (bi and bj) 

4. EMPIRICAL NETWORK ANALYSIS 
4.1 Experimental Setting 

The NK family of landscapes 1111 is a problem-independent 
model for constructing multimodal landscapes that can gradually 
be tuned from smooth to rugged. In the model, N refers to the 
number of (binary) genes in the genotype (i.e. the string length) 
and K to the number of genes that influence a particular gene. By 
increasing the value of K from to A — 1, NK landscapes can be 
tuned from smooth to rugged. The k variables that form the context 
of the fitness contribution of gene s, can be chosen according to dif- 
ferent models. The two most widely studied models are the random 
neighborhood model, where the k variables are chosen randomly 
according to a uniform distribution among the n — 1 variables other 
than Si, and the adjacent neighborhood model, in which the k vari- 
ables that are closest to Si in a total ordering si, S2, ■ ■ ■ , s n (us- 



ing periodic boundaries). No significant differences between the 
two models were found in 1 1 1 1 in terms of global properties of the 
respective families of landscapes, such as mean number of local 
optima or autocorrelation length. Therefore, we explore here the 
adjacent neighborhood model, leaving the random model for future 
analysis. 

In order to avoid sampling problems that could bias the results, 
we used the largest values of N that can still be analyzed exhaus- 
tively with reasonable computational resources. We thus extracted 
the local optima networks of landscape instances with N = 16, 18, 
and K = 2,4,6, N - 2, N - 1. For each pair of N and K val- 
ues, 30 instances were explored. Therefore, the networks statistics 
reported below represent the average behaviour of 30 independent 
instances. 

4.2 General Network Statistics 

Table [TJ reports the average of the network properties measured 
on NK landscapes for N — 16, 18 and all even K values; K — 
N — 1 is also given. Values are averages over 30 randomly gener- 
ated landscapes. n v and n e are, respectively, the mean number of 
vertices and the mean number of edges of the graph for a given K 
rounded to the next integer. C is the average of the mean clustering 
coefficient^] over all the generated landscapes. CV is the average 
clustering coefficient of a random graph with the same number of 
vertices and mean degree, z is the average of the mean degrees. lis 
the average of the mean path lengths over all landscape instances. 
The last column contains the average degree assortativity coeffi- 
cient a, which measures whether nodes with similar degrees tend 
to pair up with each other. The assortativity coefficient is computed 
according to 1 14 1. 

Notice that the mean number of vertexes (n v ) confirms that the 
number of local optima (and thus the search difficulty) increases 
with the value of K. Some other interesting inferences can be 
drawn from these metrics. First of all, looking at the I values one 
can conclude that the maxima networks are small worlds for all val- 
ues of K since the growth of I is bounded by a function 0(log n v ). 
In a sense, this is not surprising as the whole configuration space 
spans the binary hypercube {0, 1}^ of degree N with 2 N vertices, 
which has maximum distance (diameter) d — log2 N , i.e. 16 and 
18 for our studied instances. However, while the base configuration 
space has constant degree for any node, the maxima network are 
degree-inhomogeneous (see next section) and have clustering co- 
efficients well above those of equivalent random graphs, showing 
that there is local structure in the networks. For both N = 16 and 
18, the mean degree z first increases with K and then goes down 
again for K > 8. The assortativity coefficients are always very 
small which means that there is almost no correlation between the 
degrees of neighboring nodes. For easy energy landscapes, Doye 
found that the networks were slightly disassortative (6). 

4.3 Degree Distributions 

The degree distribution function p(k) of a graph represents the 
probability that a randomly chosen node has degree k [14|. Ran- 
dom graphs are characterized by a p(k) of Poissonian form, while 
social and technological real networks often show long tails to the 
right, i.e. there are nodes that have an unusually large number of 

'The clustering coefficient d of a node i is defined as d = 
2Ei/ki(ki — 1), where Ei is the number of edges in the neigh- 
borhood of i. Thus d measures the amount of "cliquishness" of 
the neighborhood of node i and it characterizes the extent to which 
nodes adjacent to node i are connected to each other. The cluster- 
ing coefficient of the graph is simply the average over all nodes: 



Table 1: Network properties of TV A' landscapes for TV = 16, 18 and all even K values; A = TV — 1 is also given. Values are averages 
over 30 randomly generated landscapes, standard deviations are shown as subscripts. n v and n e represent the number of vertexes 
and edges (rounded to the next integer), C, the mean clustering coefficient, whilst C r is the clustering coefficient of a random graph 
with the same number of vertexes and mean degree, which is C r ~ z/n v . z represent the mean degree, Tthe mean path length , and 
a the degree assortativity coefficient. 
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Figure 2: Cumulative degree distributions for TV = 16 and K = 4 (top), K = 10 (bottom). All 30 curves are plotted. Left: Log-log 
plot. Right: Lin-log plot. 



neighbors. Sometimes this behavior can be described by a power- 
law, but often the distribution is less extreme and can be fitted by 
a stretched exponential or by an exponentially truncated power- 
law fl4l. 

Figure [2] shows all the curves for 30 randomly generated land- 
scapes for TV = 16 and A = 4, 10, whilst figure [3]does the same 
for TV = 18. To smooth out fluctuations in the high degree region, 
the cumulative degree distribution function is plotted, which is just 
the probability that the degree is greater than or equal to k. The sin- 
gle curves are shown rather than the average curve because the sum 
of a sufficient number of independent random variables with arbi- 



trary distributions, provided that the first few moments exist and are 
finite, tends to distribute normally according to a general formula- 
tion of the central limit theorem |7|. In other words, if the average 
of the sum were plotted, the original shapes would essentially be 
lost. The curves cannot be described by power-laws: this possibil- 
ity is ruled out by the left parts of figs. [2] and [3] which are double 
logarithmic plots. In log-log plots, power laws should appear as 
straight lines, at least for a sizable part of abscissae range. 

On the other hand, the right images in the same figures show that 
the distributions can be fitted approximately by exponentials of the 
type p(k) = (l/z)e~ h / z where z is the mean degree, as most 
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Figure 3: Cumulative degree distributions for N = 18 and K — 4 (top), K = 10 (bottom). All 30 curves are plotted. Left: Log-log 
plot. Right: Lin-log plot. 



curves are approximately straight lines on these linear-log plots. 
This is true for the larger part of the degree range. When we ap- 
proach the finite degree cutoff the fit is obviously less good. Small 
networks such as those with N = 16 and K = 4 show larger 
fluctuations and their tails decay faster than exponentially. Two 
particular examples with a medium value of K (K = 8) are shown 
in detail in fig. [4] together with an exponential fit. Table[2]gives the 
parameters of the regression lines for all N and K values. 
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Figure 4: Cumulative degree distribution (with regression line) 
of two representative instances with K = 8, N = 16 (top) and 
N = 18 (bottom). 



Table 2: Correlation coefficient (p), intercept (a) and slope (/?) 
and slope of the linear regression between the cumulative num- 
ber of nodes and the degree of nodes : log(p(fc)) = a + /3k + e. 
The averages and standard deviations of 30 independent land- 
scapes, are shown. 
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If we compare these results with Doye's (5] |6) the most impor- 
tant difference is that we do not observe power-law distributions. 
Indeed, power-law degree distributions of the inherent energy land- 
scape networks point to the "easiness" of those landscapes: due to 
the presence of highly connected nodes, which are also among the 
fittest, a simple gradient-descent would bring a searcher down to 
a local energy minimum, often the global one, starting anywhere 
in the configuration space. In other words, there exist the "funnel" 
effect described by Doye |5|. In contrast, NK landscapes have 
tunable difficulty. How can random networks with exponential de- 



gree distributions be obtained? One way is the following: in each 
time step, just add a new node, and add a new link between two 
randomly chosen nodes, including the new one. Iterating this dy- 
namical process produces graphs with an exponential distribution 
of the node degrees |4|. But NK landscapes are static and thus it is 
difficult to see how this process could be implemented. However, 
the following qualitative explanation might help. Imagine that K is 
increased from 2 to TV — 1 in single steps. Then we could have the 
image of the previous landscape increasing its size and deforming 
itself when K goes from its current value to K + 1. The new max- 
ima that appear could be considered as if they were added dynam- 
ically (of course some previous optima might disappear as well). 
Edges in the new landscape are selected essentially randomly, with 
more probability of selecting an already existing node. Thus, with 
this imaginary mechanism a distribution close to exponential would 
be obtained. 

Thus, as the NK landscape difficulty varies smoothly when K 
is increased, the degree distribution of the corresponding maxima 
networks remains essentially exponential. We do not observe scale- 
free distributions for the easy landscapes as in the energy landscape 
case 0. This is understandable: standard energy landscapes in 
molecular chemistry and crystal physics do correspond to thermo- 
dynamically stable states which are naturally smooth and easy to 
reach when the system is forming or it is slightly perturbed. In 
contrast, NK landscape are synthetic and do not correspond to any 
physical principle in their construction. The only physical systems 
that resemble NK landscapes are spin glasses, in which conflicting 
energy minimization requirements lead to frustration and to land- 
scape ruggedness I11II17I . However, disordered condensed matter 
systems similar to spin glasses are only obtained in particular situ- 
ations, for instance by fast cooling 1 17 1. 

4.4 Basins of Attraction 

Besides the maxima network, it is useful to describe the asso- 
ciated basins of attraction as these play a key role in search al- 
gorithms. Furthermore, some characteristics of the basins can be 
related to the network features described above. The notion of 
the basin of attraction of a local maximum has been presented in 
sect. 13.11 We have exhaustively computed the size and number of 
all the basins of attraction for N — 16 and TV = 18 and for all even 
K values plus K = N — 1. In this section, we analyze the basins 
of attraction from several points of view as it is described below. 

4.4.1 Global optimum basin size vs. K 
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In Figure [5] we plot the average size of the basin corresponding 
to the global maximum for TV = 16 and TV = 18, and all values 
of K studied. The trend is clear: the basin shrinks very quickly 
with increasing K. This confirms that the higher the K value, the 
more difficult for an stochastic search algorithm to locate the basin 
of attraction of the global optimum 

4.4.2 Number of basins of a given size 
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Figure 5: Average of the relative size of the basin corresponding 
to the global maximum for each K over 30 landscapes. 



Figure 6: Cumulative distribution of the number of basins of a 
given size with regression line. Two Representative landscapes 
are visualized with N=16 (top) and N=18 (bottom) and K=4. A 
lin-log scale is used. 



Figure [6] shows the cumulative distribution of the number of 
basins of a given size (with regression line) for two representative 
instances with K = 4 and TV = 16 (top) and TV = 18. Table [3] 
shows the average (of 30 independent landscapes) correlation co- 
efficients and linear regression coefficients (intercept (a) and slope 
(/?)) between the number of nodes and the basin sizes. Notice that 
distribution decays exponentially or faster for the lower K and it 
is closer to exponential for the higher K. This observation is rele- 
vant to theoretical studies that estimate the size of attraction basins 
(see for example [8]). These studies often assume that the basin 
sizes are uniformly distributed. From the slopes (3 of the regression 
lines (table|3j one can see that high values of K give rise to steeper 
distributions (higher /3 values). This indicates that there are less 
basins of large size for large values of K. In consequence, basins 
are broader for low values of K, which is consistent with the fact 
that those landscapes are smoother. 

4.4.3 Fitness of local optima vs. their basin sizes 

The scatter-plots in figure|7]illustrate the correlation between the 
basin sizes of local maxima (in logarithmic scale) and their fitness 
values. Two representative instances for TV = 18 and K = 4,8 are 
shown. Table |4]shows the averages (of 30 independent landscapes) 
of the correlation coefficient, and the linear regression coefficients 
between these two metrics (maxima fitness and their basin sizes). 



Table 3: Correlation coefficient (p), and linear regression co- 
efficients (intercept (a) and slope (/3)) of the relationship be- 
tween the basin size of optima and the cumulative number of 
nodes of a given (basin) size ( in logarithmic scale: log(p(s)) = 
a + (3s + e). The average and standard deviation values over 30 
instances, are shown. 



N = 16 


K 


P 


a 


p 


2 


-0. 944 .0454 


2.89o.673 


-0.0003o.ooo2 


4 


-0.959o.osio 


4.19 .554 


-0.0014o.0006 


6 


-0.967o.o2so 


5.09o.504 


-0.0036o.ooio 


8 


— 0.982o.oii6 


5.97o.32i 


-0.0080o.ooi3 


10 


-0.985o.oiei 


6.74 .392 


-0.0163o.oo25 


12 


-0. 990o.oo88 


7.47 .346 


-0.0304o.0042 


14 


-0.994o.oo59 


8.08o.241 


-0.0508o.oo48 


15 


— 0.995o.oo44 


8.37o.240 


-0.0635o.0058 


N = 18 


2 


-0.959o.0257 


3.18o.696 


-0.0001o.oooi 


4 


-0.960o.o409 


4.57 .617 


-0.0005o.ooo2 


6 


-0.967o.0283 


5.50o.520 


-0.0015o.ooo4 


8 


— 0.977o.0238 


6.44 .485 


-0.0037o.ooo7 


10 


-0.985o.oi4i 


7-24o.372 


-0.0077o.oon 


12 


-0.989o.oi29 


7.98o.370 


-0.0150o.ooi9 


14 


-0.993o.oo72 


8.69o.276 


— 0.0272o.oo24 


16 


-0.995o.oo56 


9.33o.249 


-0.0450o.0036 


17 


-0. 992 .oii3 


9.49o.386 


-0.0544o.0058 



Table 4: Correlation coefficient (p), and linear regression coef- 
ficients (intercept (2) and slope (/3)) of the relationship between 
the fitness of optima and their basin size (in logarithmic scale: 
log(s) = a + [3f + e). The average and standard deviation 
values over 30 instances, are shown 



N = 16 


K 


P 


a 


p 


2 


0.832o.0879 


-15. 476 5 . 9401 


33.0668.9252 


4 


0.842o.0259 


-13.035i.9907 


27.0942.86ii 


6 


0.852o.oi80 


— 12.977o.9921 


26.061i.4908 


8 


0.8600.0088 


-12.570o.3769 


24.880o.5725 


10 


0.850 .oo50 


-11. 954 .3501 


23.561o.5421 


12 


0.833o.oo65 


-11. 485 .2993 


22.519o.4773 


14 


0.8160.0047 


— ll.261o.2008 


21.864o.3256 


15 


0.812q.0044 


— 11.352q.2109 


21.8760. 3298 


N = 18 


2 


0.839o.o680 


-16.5856.0606 


35. 925s. 6640 


4 


0.842 .0257 


-14.4582.1746 


30.174 3 .1520 


6 


0.852o.oi40 


-14.542o.9596 


29.219i.4i47 


8 


0.867o.oo66 


-14.515o.3750 


28. 5380.5988 


10 


O.8660. 0038 


-13. 914 .3068 


27.209o.462i 


12 


0.854 .oo30 


-13. 18O0.1700 


25.751o.2804 


14 


0.836o.oo27 


-12.602o.l399 


24.553o.22i4 


16 


0.822o.oo22 


-12.502o.l039 


24.133o.1633 


17 


0.817o.0027 


— 12.583q.1278 


24.143o.2066 



exp. 
regr. line 



,.{?Z f J0^% ******* 

» ~ :ji!i?S^ ********"* * • 



0.56 0.58 0.6 0.62 0.64 0.66 0.68 0.7 0.72 0.74 0.76 0.78 
fitness of local optima 

: ' regr. line : 




0.5 0.55 0.6 0.65 0.7 0.75 0.E 
fitness of local optima 



Figure 7: Correlation between the fitness of local optima 
and their corresponding basin sizes, for two representative in- 
stances with N = 18, K = 4 (top) and K = 8 (bottom). 



All the studied landscapes for N = 16 and 18, are reported. Notice 
that, there is a clear positive correlation between the fitness values 
of maxima and their basins' sizes. In other words, the higher the 
peak the wider tend to be its basin of attraction. Therefore, on 
average, with a hill-climbing algorithm, the global optimum would 
be easier to find than any other local optimum. This may seem 
surprising. But, we have to keep in mind that as the number of local 



optima increases (with increasing K), the global optimum basin 
is more difficult to reach by an stochastic local search algorithm 
(see figure [5). This observation offers a mental picture of NK 
landscapes: we can consider the landscape as composed of a large 
number of mountains (each corresponding to a basin of attraction), 
and those mountains are wider the taller the hilltops. Moreover, the 
size of a mountain basin grows exponentially with its hight. 

4.4.4 Basins sizes of local optima vs. their degrees 

The scatter plots in figure [8] illustrate the correlation between 
basin sizes of maxima and their degrees. Representative instances 
with TV = 18, and K = 4,8, are illustrated. There is a clear pos- 
itive correlation between the degree and the basin sizes of maxima 
in the network. This observation suggests that landscapes with low 
K values can be searched more effectively since a given config- 
uration has many neighbors belonging to the same large basin of 
attraction. It is also confirmed that the basins for low K are much 
larger than those for high K, not only the basin corresponding to 
the global maximum. 

5. CONCLUSIONS 

We have proposed a new characterization of combinatorial fit- 
ness landscapes using the well-known family of NK landscapes as 
an example. We have used an extension of the concept of inher- 
ent networks proposed for energy surfaces 1 5 1 in order to abstract 
and simplify the landscape description. In our case the inherent 
network is the graph where the vertices are all the local maxima 
and edges mean basin adjacency between two maxima. We have 
exhaustively obtained these graphs for N = 16 and TV = 18, and 
for all even values of K, plus K = N — 1. The maxima graphs 
are small worlds since the average path lengths are short and scale 
logarithmically in the size of the graphs. However, the maxima 
graphs are not random. This is shown by their clustering coeffi- 
cients, which are much larger than those of corresponding random 
graphs and also by their degree distribution functions, which are not 
Poissonian but rather exponential. The construction of the maxima 
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Figure 8: Correlation between the degree of local optima 
and their corresponding basin sizes, for two representative in- 
stances with with N = 18, K = 4 (top) and K = 8 (bottom). 



networks requires the determination of the basins of attraction of 
the corresponding landscapes. We have thus described the nature 
of the basins and their relationship with the local maxima network. 
We have found that the size of the basin corresponding to the global 
maximum becomes smaller with increasing K. The distribution of 
the basin sizes is approximately exponential for all N and K , but 
the basin sizes are larger for low K, another indirect indication of 
the increasing randomness and difficulty of the landscapes when 
K becomes large. Finally, there is a strong positive correlation be- 
tween the basin size of a maxima and their degree, which confirms 
that the synthetic view provided by the maxima graph is a useful 
one. 

This study represents our first attempt towards a topological and 
statistical characterization of easy and hard combinatorial land- 
scapes. Much remains to be done. First of all, the results found 
should be confirmed for larger instances of NK landscapes. This 
will require good sampling techniques, or theoretical studies since 
exhaustive sampling becomes quickly impractical. Other landscape 
types should also be examined, such as those containing neutral- 
ity, which are very common in real- world applications. Work is in 
progress for neutral versions of NK landscapes. Finally, the land- 
scape statistical characterization is only a step toward implement- 
ing good methods for searching it. We thus hope that our results 
will help in designing or estimating efficient search techniques and 
operators. 
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